The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

نویسندگان

Christopher Cieri

Walter D. Andrews

Joseph P. Campbell

George R. Doddington

John J. Godfrey

Shudong Huang

Mark Liberman

Alvin F. Martin

Hirotaka Nakasone

Mark A. Przybocki

Kevin Walker

چکیده

This paper describes the planning and creation of the Mixer and Transcript Reading corpora, their properties and yields, and reports on the lessons learned during their development.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker Recognition: Building the Mixer 4 and 5 Corpora

The original Mixer corpus was designed to satisfy developing commercial and forensic needs. The resulting Mixer corpora, Phases 1 through 5, have evolved to support and increasing variety of research tasks, including multilingual and cross-channel recognition. The Mixer Phases 4 and 5 corpora feature a wider variety of channels and greater variation in the situations under which the speech is r...

متن کامل

The MMSR bilingual and crosschannel corpora for speaker recognition research and evaluation

We describe efforts to create corpora to support and evaluate systems that meet the challenge of speaker recognition in the face of both channel and language variation. In addition to addressing ongoing evaluation of speaker recognition systems, these corpora are aimed at the bilingual and crosschannel dimensions. We report on specific data collection efforts at the Linguistic Data Consortium, ...

متن کامل

Resources for new research directions in speaker recognition: the mixer 3, 4 and 5 corpora

This paper describes new language resources designed to support research in speaker recognition. It begins with a brief overview of collections protocols, motivates the shift from the Switchboard protocol to the Mixer protocol, summarizes yields from the earliest phase of Mixer collection and then describes more recent phases, yields and expected yields and lessons learned.

متن کامل

The Mixer Corpus of Multilingual, Multichannel Speaker Recognition Data

This paper describes efforts to create corpora to support and evaluate systems that perform speaker recognition where channel and language may vary. Beyond the ongoing evaluation of speaker recognition systems, these corpora are aimed at the bilingual and cross channel dimensions. We report on specific data collection efforts at the Linguistic Data Consortium and the research ongoing at the US ...

متن کامل

Transcript-dependent speaker recognition using mixer 1 and 2

Transcript-dependent speaker-recognition experiments are performed with the Mixer 1 and 2 read-transcription corpus using the Lincoln Laboratory speaker recognition system. Our analysis shows how widely speaker-recognition performance can vary on transcript-dependent data compared to conversational data of the same durations, given enrollment data from the same spontaneous conversational speech...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2006

The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

نویسندگان

چکیده

منابع مشابه

Speaker Recognition: Building the Mixer 4 and 5 Corpora

The MMSR bilingual and crosschannel corpora for speaker recognition research and evaluation

Resources for new research directions in speaker recognition: the mixer 3, 4 and 5 corpora

The Mixer Corpus of Multilingual, Multichannel Speaker Recognition Data

Transcript-dependent speaker recognition using mixer 1 and 2

عنوان ژورنال:

اشتراک گذاری